A Generative View of Ill - Formed Input Processing
نویسنده
چکیده
The key idea behind this work is that of a weighted grammar. For simplicity we consider here only a special case of the more general de nition given in [Teitelbaum 73], which could lead to other interesting variations (e.g. probabilities as weights, using multiplication instead of addition as below). We de ne a weighted grammar as a Context-Free (CF) grammar with a numeric weight attached to each of its rules. We attach to any derivation tree a weight that is the sum of the weights of all instances of rules used in that derivation.
منابع مشابه
A Rule-Based Approach To Ill-Formed Input
Though natural language understanding systems have improved markedly in recent years, they have only begun to consider a major problem of truly natural input: ill-formedness. Quite often natural language input is ill-formed in the sense of being misspelled, ungrammatical, or not entirely meaningful. A requirement for any successful natural language interface must be that the system either intel...
متن کاملParse Fitting and Prose Fixing: Getting a Hold on III-Formedness
Processing syntactically ill-formed language is an important mission of the EPISTLE system, lll-formed input is treated by this system in various ways. Misspellings are highlighted by a standard spelling checker; syntactic errors are detected and corrections are suggested; and stylistic infelicities are called to the user's attention. Central to the EPISTLE processing strategy is its technique ...
متن کاملViolable principles and typological variation
In the Government-based literature, all structural principles supplied by UG are, without exception, assumed to exert their influence over every well-formed representation. We do find cases, however, where two principles that appear to be universally applicable make opposing predictions as to the grammaticality of a given structure. To resolve such instances of principle clash, I propose a loca...
متن کاملReflections on the Knowledge Needed to Process Ill-Formed Language
This paper reflects about the kinds of morphological, syntactic, semantic, and pragmatic knowledge needed to process ill-formed input. We conclude that an excellent start on processing ill—formed input has been exemplified in a number of concrete implementations, but that a substantial amount of fundamental work must still be done if our systems are to understand language robustly to the degree...
متن کاملSchema method: a framework for correcting grammatically ill-formed input
The schema method is a framework for correcting grammatically ill-formed input. In a natural language processing system ill-formed input cannot be overlooked. A computer assisted instruction (CAD system, in particular, needs to show the user's errors. This framework diagnoses ill-formed input, corrects it and explains the error, if an input is ill-~'ormed. The framework recognizes a sentence at...
متن کامل